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DETAILED ACTION 

Continued Examination Under 37 CFR 1.114 

1. A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1.17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1.17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
10/09/2008 has been entered. 

2. Claims 1, 7, 12, 16, 19, 23, 37, 48-50 have been amended. Clams 51-57 have 
been added. 

3. The examiner is not making a 35 U.S.C. 101 rejection because the "memory" is 
defined as tangible components in Fig. 4 [52 & 54]. 

Information Disclosure Statement 

4. The information disclosure statements filed on 10/9/2008 & 11/7/2008 have been 
considered. 

Response to Arguments 

5. Applicant's arguments filed 10/9/2008 have been fully considered but they are 
not persuasive. 

6. In response to applicant's argument that the references fail to show certain 
features of applicant's invention, it is noted that the features upon which applicant relies 
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(i.e., "Wang does not teach an apparatus configured to extract lower level features that 
may be later used to extract higher level features off-apparatus to identify a media from 
a media sample", "a remote service" "to extract higher level features from the received 
lower level features to uniquely identify a media corresponding to the media sample" 
Page 12) are not recited in the rejected claims 1, 23, 48 and 50. Although the claims 
are interpreted in light of the specification, limitations from the specification are not read 
into the claims. See In re Van Geuns, 988 F.2d 1181, 26 USPQ2d 1057 (Fed. Cir. 
1993). 

Claim Rejections - 35 USC § 103 

7. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

8. Claims 1-9, 12, 16, 17, 22-27, 30, 35, 38, 40 and 47-55 are rejected under 35 
U.S.C. 103(a) as being unpatentable over Wang et al. (US-6,990,453 hereinafter, 
Wang) in view of Barton et al. (US 2002/0072982 hereinafter, Barton). 

Regarding claim 1, Wang teaches an apparatus (Col. 7 line 67 through Col. 8 line 
5) comprising: 

an interface configured to receive a media sample; (Fig. 1 [12] and Col. 5 line 36 
through Col. 6 line 15) 
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a processor configured to extract a first set of lower level but not higher level 
features from a digital version of the media sample; (Col. 6 lines 14-34 and Fig. 1 [14]) 

a transmitter configured to transmit the extracted first set of lower level but not 
higher level features over a wireless communication link; (Col. 6 line 61 through Col. 7 
line 11 and Col. 8 lines 5-24) 

and inherently includes a receiver for receiving information over a wireless 
communication link (Col. 7 line 67 through Col. 8 line 5 "mobile phone") an notification 
as to identification of a media corresponding to the media sample from the remote 
service. (Col. 7 lines 57-59, Col. 16 lines 55-56 and Col. 17 lines 31-34) 

Wang differs from the claimed invention by not explicitly reciting the receiver is 
for receiving a request message over the wireless link that requests at least one 
additional feature and the processor is automatically responsive to the request message 
to extract a second set of features from the digital version of the media sample and the 
transmitter is further to transmit the extracted second set of lower level but not higher 
level features. 

In an analogous art, Barton teaches a system for identifying audio samples that 
includes a recursive feature for automatically requesting more information in order to 
narrow the search results to find the corresponding file and transmit the results. (Page 
5 [0048 and 0049] the "resolution of the derivation is coupled, in large measure, to the 
level of discrimination required in selecting an event to be triggered. As the number of 
potentially triggered events increases, the necessity to resolve ambiguity in the sample 
also increases", Page 6 [0059] "the song excerpt may be increased in length, or a 
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different excerpt may be furnished, in an iterative manner" until a song is identified, 
Page 7 [0067-0068] and Page 11 [0138]) At the time the invention was made, it would 
have been obvious to one of ordinary skill in the art to implement the mobile station of 
Wang after modifying it to incorporate the ability to increase resolution to resolve 
ambiguity of Barton. One of ordinary skill in the art would have been motivated to do 
this since it enables back and forth communication to resolve ambiguity. (Page 5 [0048- 
0049], Page 6 [0059] and Page 7 [0067-0068]) 

Regarding claim 2, Wang in view of Barton teaches the interface comprises a 
transducer. (Wang Col. 7 line 67 through Col. 8 line 5) 

Regarding claim 3, Wang in view of Barton teaches the transducer comprises a 
microphone and the media sample comprises an audio sample. (Wang Col. 5 lines 36- 
59, Col. 7 line 67 through Col. 8 line 5, Fig. 1 [12] and Col. 15 lines 25-58) 

Regarding claim 4, Wang in view of Barton teaches the transducer comprises a 
camera and the media sample comprises a visual sample. (Wang Col. 5 lines 36-59) 

Regarding claim 5, Wang in view of Barton obviously teaches the interface 
comprises one of a cable and a wireless link. (Wang Col. 7 line 67 through Col. 8 line 5 
and Col. 15 lines 25-58) 

Regarding claim 6, Wang in view of Barton teaches the media sample that the 
interface receives is the digital version. (Wang Col. 15 lines 25-58) 

Regarding claim 7, Wang in view of Barton teaches the transmitter is further 
configured to transmit a message that includes the at least one extracted lower level but 
not higher level feature and no portion of the digital version of the media sample. 
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(Wang Col. 4 lines 23-32 [LPC coefficients and frequency components of spectrogram 
peaks]) 

Regarding claim 8, Wang in view of Barton teaches the processor is further 
configured to adaptively select a number of lower level but not higher level features to 
extract based on the digital version of the media sample. (Wang Col. 4 lines 23-32 
[LPC coefficients and frequency components of spectrogram peaks]) 

Regarding claim 9, Wang in view of Barton teaches the processor is further 
configured to adaptively select at least one type of feature to extract based on the digital 
version of the media sample, the processor extracts at least one feature of the 
adaptively selected type, and wherein the transmitter is further configured to transmit an 
identifier of the selected type of feature. (Wang 4 lines 15-41 and Col. 7 line 3 through 
Col. 8 line 24) 

Regarding 12, Wang in view of Barton teaches a user interface for causing the 
transmitter to transmit the first set of lower level but not higher level features, and a 
buffer to store at least a portion of the digital version of the media sample, wherein the 
processor extracts at least some of the first set prior to a user input at the said user 
interface. (Wang Col. 21 line 57 through Col. 22 line 50) 

Regarding claim 16, Wang in view of Barton teaches a user interface (Wang Col. 
5 lines 36-59 and Col. 7 line 67 through Col. 8 line 24) by which a single user input 
initiates: 

the processor to extract the first set of lower level but not higher level features, a 
wireless communications link to be established between the MS and a communication 
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service, and the extracted first set of lower level but not higher level features to be 
transmitted over the wireless communications link. (Wang Col. 6 line 61 through Col. 7 
line 36 and Col. 7 line 67 through Col. 8 line 24) 

Regarding claim 17, Wang in view of Barton teaches the single user input further 
initiates a buffer disposed between the transducer and the processor to begin storing at 
least a portion of the digital version of the media sample. (Wang Col. 15 line 25 through 
Col. 16 line 2) 

Regarding claim 22, Wang in view of Barton teaches the request message 
specifically identifies each additional feature at least by type and the second set of 
features comprises only features of the said identified type. (Wang Col. 15 line 59 
through Col. 16 line 2) 

Regarding claim 23, Wang in view of Barton teaches a computer readable 
medium embodied with a computer program (Wang Col. 7 line 67 through Col. 8 line 
24) comprising: 

a first set of computer instructions to extract in response to a user input, a first 
set of lower level but not higher level features from a digital media sample (Wang Col. 6 
lines 14-34 and Fig. 1 [14]), and to extract in response to a received request message 
to the device from a remote service, through wireless communications, a second set of 
lower level but not higher level features consistent with additional features that are 
requested in the request message, (Barton Page 5 [0048 and 0049] the "resolution of 
the derivation is coupled, in large measure, to the level of discrimination required in 
selecting an event to be triggered. As the number of potentially triggered events 
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increases, the necessity to resolve ambiguity in the sample also increases", Page 6 
[0059] "the song excerpt may be increased in length, or a different excerpt may be 
furnished, in an iterative manner" until a song is identified, Page 7 [0067-0068] and 
Wang Col. 7 line 67 through Col. 8 line 21 i.e. receiving the file fingerprints over multiple 
messages) and 

a second set of computer instructions to transmit in separate messages (Wang 
Col. 7 lines 3-11 and Col. 8 lines 16-21) the first and second sets of extracted lower 
level but not higher level features over a wireless communication link to a remote 
service; (Wang Col. 7 line 67 through Col. 8 line 24) and 

a third set of computer instructions for receiving a notification of identification of a 
media corresponding to the media sample from the remote service. (Wang Col. 7 lines 
57-59, Col. 16 lines 55-56, Col. 17 lines 31-34 and Barton Page 7 [0067-0068]) 

Regarding claim 24, the limitations of claim 24 are rejected as being the same 
reason set forth above in claim 7. 

Regarding claim 25, the limitations of claim 25 are rejected as being the same 
reason set forth above in claim 8. 

Regarding claim 26, the limitations of claim 26 are rejected as being the same 
reason set forth above in claim 9. 

Regarding claim 27, Wang in view of Barton teaches the ability to transmit 
extracted features and time-bounded segments. (Wang Col. 6 line 61 through Col. 7 
line 11) 
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Regarding claim 30, Wang in view of Barton teaches at least one feature defines 
a timepoint, the first set of computer instructions is to extract at least one timepoint from 
the digital media sample, and one of said messages comprises a timepoint, a spectral 
slice of the digital media sample and an identifier that links the spectral slice to the 
timepoint. (Wang Fig. 8A and Col. 6 line 35 through Col. 7 line 36, Col. 8 line 61 
through Col. 9 line 32 and Col. 21 lines 13-29) 

Regarding claim 35, the limitations of claim 35 are rejected as being the same 
reason set forth above in claim 19. 

Regarding claim 38, Wang in view of Barton teaches each feature set is 
associated with a media file title (Wang Fig. 8B), the computer program further 
comprising a fifth set of computer instructions to transmit, over the network to a sender 
of the message, a reply message that includes the media file title. (Wang Fig. 1 [22] 
and Col. 6 lines 35-60) 

Regarding claim 40, Wang in view of Barton teaches the fourth set of computer 
instructions further is to determine a link address for a media file uniquely associated 
with the uniquely identified feature set, and wherein the fifth set of computer instructions 
is further to transmit the link address in the reply message. (Barton Page 2 [0022- 
0023]) 

Regarding claim 47, Wang in view of Barton teaches the request message 
includes at least one of a number of additional features and a type of the at least one 
additional feature. (Wang Col. 12 line 38 through Col. 13 line 10 and Barton Page 5 
[0048-0049]) 
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Regarding claim 48, the limitations of claim 48 are rejected as being the same 
reasons set forth above in claim 23. 

Regarding claim 49, Wang in view of Barton teaches the means for receiving a 
media sample comprises a transducer, and the means for extracting comprises a digital 
processor. (Wang Col. 7 line 12 through Col. 8 line 5 and Col. 15 lines 53-55) 

Regarding claim 50, the limitations of claim 50 are rejected as being the same 
reasons set forth above in claim 1 . 

Regarding claim 51, Wang in view of Barton teaches the sampling device is a 
phone or PDA (Barton Page 1 [0007 & 0012]) and it is obvious to one of ordinary skill in 
the art to recognize that a phone/PDA has buttons that can be configured to initiate the 
identification of media from the media sample. 

Regarding claim 52, the limitations of claim 52 are rejected as being the same 
reason set forth above in claim 51 . 

Regarding claim 53, the limitations of claim 53 are rejected as being the same 
reason set forth above in claim 51 . 

Regarding claim 54, the limitations of claim 54 are rejected as being the same 
reason set forth above in claim 51 . 

Regarding claim 55, Wang in view of Barton teaches providing a link, after the 
remote service identifies the media, that, when pressed, accesses a music service for 
downloading the media. (Barton Page 6 [0054-0057]) 

9. Claims 18-20, 37, 56 and 57 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Wang et al. (US-6,990,453 hereinafter, Wang) in view of Barton et al. 
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(US 2002/0072982 hereinafter, Barton) and Vetro et al. (US-6,490,320 hereinafter, 
Vetro). 

Regarding claim 18, Wang in view of Barton teaches the limitations of claim 1 
above, but differs from the claimed invention by not explicitly reciting the first and 
second sets of features comprise MPEG-7 descriptors. 

In an analogous art, Vetro teaches extracting MPEG-7 descriptors from the 
digital version of the media sample. (Vetro Col. 4 lines 35-37, Col. 21 lines 57-67 and 
Col. 22 lines 14-33) At the time the invention was made, it would have been obvious to 
one of ordinary skill in the art to implement the recognition system of Wang in view of 
Barton after modifying it to incorporate the MPEG-7 descriptors of Vetro in order to 
better describe the content within the file in order to simplify searches. (Col. 4 line 64 
through Col. 5 line 6) One of ordinary skill in the art would have been motivated to do 
this since knowing the content of the file enables efficient distribution of the content on a 
network. 

Regarding claim 19, Wang in view of Barton and Vetro teaches the processor 
extracts MPEG-7 file information that are lower level but not higher level features and 
are non-reconstructive of the digital version of the media sample. (Vetro Col. 4 line 64 
through Col. 5 line 6 & Col. 22 lines 30-33) 

Regarding claim 20, Wang in view of Barton and Vetro teaches the extracted 
features (Vetro Col. 4 line 64 through Col. 5 line 6 & Col. 22 lines 30-33) for which the 
transmitter is to transmit are non-reconstructive of the digital version of the media 
sample. (Vetro Col. 4 line 64 through Col. 5 line 6 & Col. 22 lines 30-33) 
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Regarding claim 37, Wang in view of Barton teaches a computer program 
embodied on a computer readable medium to uniquely match a plurality of extracted 
features to a feature set stored in a database comprising: 

a first set of computer instructions to separately receive over a network to a 
remote service from a device through wireless communications a first and second 
message that includes first and second sets of received features (Wang Col. 7 lines 3- 
11 and Col. 7 line 67 through Col. 8 line 24), respectively, where the first set is lower 
level but not higher level features; 

a second set of computer instructions to search a database of feature sets for all 
matching sets that match the first set of received features and to determine a second 
set of at least one additional feature that distinguishes among each of the matching sets 
(Barton Page 5 [0048-0049]) 

a third set of computer instructions to transmit over the network a request 
message that stipulates the second set of lower level but not higher level additional 
features; (Barton Page 5 [0048 and 0049] the "resolution of the derivation is coupled, in 
large measure, to the level of discrimination required in selecting an event to be 
triggered. As the number of potentially triggered events increases, the necessity to 
resolve ambiguity in the sample also increases", Page 6 [0059] "the song excerpt may 
be increased in length, or a different excerpt may be furnished, in an iterative manner" 
until a song is identified and Page 7 [0067-0068]) and 

a fourth set of computer instructions to uniquely identify one feature set from 
among the matching sets using the second set of received features. (Barton Page 5 
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[0048-0051], Page 6 [0059], Page 7 [0067-0068] and Wang Col. 16 line 45 through Col. 
17 line 39) 

Wang in view of Barton teaches requesting at least one additional feature (Barton 
Page 5 [0048 and 0049] the "resolution of the derivation is coupled, in large measure, to 
the level of discrimination required in selecting an event to be triggered. As the number 
of potentially triggered events increases, the necessity to resolve ambiguity in the 
sample also increases", Page 6 [0059] "the song excerpt may be increased in length, or 
a different excerpt may be furnished, in an iterative manner" until a song is identified 
and Page 7 [0067-0068]), but differs from the claimed invention by not explicitly reciting 
extracting higher level features from the received lower level features to identify the 
media corresponding to the media sample. 

In an analogous art, Vetro teaches an adaptable bitstream video delivery system 
for MPEG-7 applications that includes generating low-level meta data for describing a 
file (Col. 4 lines 35-37) and high level description schemes that combine several low- 
level descriptors (Col. 4 lines 44-46) but do not describe how to generate the content. 
(Col. 22 lines 30-33 i.e. the descriptors describe a file and a description scheme is 
made up of several descriptors, but is not extracted directly from the media sample) At 
the time the invention was made, it would have been obvious to one of ordinary skill in 
the art to implement the mobile station of Wang in view of Barton after modifying it to 
incorporate the high level description schemes of Vetro in order to better describe the 
content within the file in order to simplify searches. (Col. 4 line 64 through Col. 5 line 6) 
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One of ordinary skill in the art would have been motivated to do this since knowing the 
content of the file enables efficient distribution of the content on a network. 

Regarding claim 56, Wang in view of Barton teaches an apparatus (Wang Col. 7 
line 67 through Col. 8 line 5) comprising: 

a receiver configured to receive over a network from a device a first message 
that includes a first set of lower level features extracted from a media sample; (Wang 
Fig. 1 [12] and Col. 5 line 36 through Col. 6 line 15 and Fig. 1 [14]) 

a transmitter configured to request over the network from the device at least one 
additional feature in a request message, wherein the receiver is configured to receive 
over the network from the device a second message responsive to the request 
message, the second message including a second set of lower level but not higher level 
features extracted form the media sample; (Barton Page 5 [0048 and 0049] the 
"resolution of the derivation is coupled, in large measure, to the level of discrimination 
required in selecting an event to be triggered. As the number of potentially triggered 
events increases, the necessity to resolve ambiguity in the sample also increases", 
Page 6 [0059] "the song excerpt may be increased in length, or a different excerpt may 
be furnished, in an iterative manner" until a song is identified, Page 7 [0067-0068] and 
Wang Col. 7 line 67 through Col. 8 line 21 i.e. receiving the file fingerprints over multiple 
messages) and 

a processor configured to use the received lower level features to identify a 
media corresponding to the media sample, wherein the transmitter is configured to 
transmit a notification of the identification of the media sample to the device. (Wang 



Application/Control Number: 10/810,924 Page 15 

Art Unit: 2617 

Col. 7 lines 57-59, Col. 16 lines 55-56, Col. 17 lines 31-34 and Barton Page 7 [0067- 
0068]) 

Wang in view of Barton teaches requesting at least one additional feature (Barton 
Page 5 [0048 and 0049] the "resolution of the derivation is coupled, in large measure, to 
the level of discrimination required in selecting an event to be triggered. As the number 
of potentially triggered events increases, the necessity to resolve ambiguity in the 
sample also increases", Page 6 [0059] "the song excerpt may be increased in length, or 
a different excerpt may be furnished, in an iterative manner" until a song is identified 
and Page 7 [0067-0068]), but differs from the claimed invention by not explicitly reciting 
extracting higher level features from the received lower level features to identify the 
media corresponding to the media sample. 

In an analogous art, Vetro teaches an adaptable bitstream video delivery system 
for MPEG-7 applications that includes generating low-level meta data for describing a 
file (Col. 4 lines 35-37) and high level description schemes that combine several low- 
level descriptors (Col. 4 lines 44-46) but do not describe how to generate the content. 
(Col. 22 lines 30-33 i.e. the descriptors describe a file and a description scheme is 
made up of several descriptors, but is not extracted directly from the media sample) At 
the time the invention was made, it would have been obvious to one of ordinary skill in 
the art to implement the mobile station of Wang in view of Barton after modifying it to 
incorporate the high level description schemes of Vetro in order to better describe the 
content within the file in order to simplify searches. (Col. 4 line 64 through Col. 5 line 6) 
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One of ordinary skill in the art would have been motivated to do this since knowing the 
content of the file enables efficient distribution of the content on a network. 

Regarding claim 57, the limitations of claim 57 are rejected as being the same 
reasons set forth above in claims 37 & 56. 

Conclusion 

Any inquiry concerning this communication should be directed to MATTHEW 
SAMS at telephone number (571 )272-8099. 



/Lester Kincaid/ 

Supervisory Patent Examiner, Art Unit 2617 



